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(iv) CORRESPONDENCE ADDRESS: 
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(B) TELEFAX: (617) 876-5851 
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2 ) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Thr Asn Thr Pro Glu Gin Asp Arg Tyr Leu Gin Ala Lys Lys Tyr lie 
15 10 15 

Glu Phe Tyr Val Val Val Asp Asn Val Met Tyr Arg Lys Tyr 
20 25 30 

) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL : NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Thr Asn Thr Pro Glu Gin Asp Arg Tyr Leu Gin Ala Lys Lys Tyr lie 
15 10 15 

Glu Phe Tyr Val Val Val Asp Asn Val Met Tyr Arg Lys Tyr Thr Gly 
20 25 30 

Lys Leu His Val lie Thr Xaa Xaa Val Tyr Glu Met Asn Ala Leu Asn 
35 40 45 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

pi, 

Glu Ala Thr Glu Tyr Glu Tyr Leu Asp Tyr Asp Phe Leu Pro Glu 
15 10 15 

y. j 

(2) INFORMATION FOR SEQ ID NO: 4: 

Hi 
U! 

==j! (i) SEQUENCE CHARACTERISTICS: 

p"> (A) LENGTH: 15 amino acids 

B (B) TYPE: amino acid 

M< (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



r" 

m 

pi a. 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 



Gin Ala Thr Glu Tyr Glu Tyr Leu Asp Tyr Asp Phe Leu Pro Glu 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2050 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 78.. 1940 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

AGTCAATAGG AGAAGAGCTC AGGTTGGCTT GGAAGCAGAA AGAGATTCCT GTCCACCACT 60 

CCAATCCAGG CTCCAAAATG ATCCAAGCTC TCTTGGTAGC TATATGCTTA GCGGTTTTTC 120 

CATATCAAGG GAGCTCTATA ATCCTGGAAT CCGGGAATGT TAATGATTAT GAAGTAGTGT 180 

ATCCACAAAA AGTGCCTGCA TTGTCCAAAG GAGGAGTTCA GAATCCTCAG CCAGAGACCA 240 

AGTATGAAGA TACAATGCAA TATGAATTTC ACGTGAACGG AGAGCCAGTG GTCCTTCACT 300 

TAGAAAGAAA TAAAGGACTT TTTTCAGAAG ATTACACTGA AACTCATTAT GCCCCTGATG 3 60 

GCAGAGAAAT TACAACAAGC TCTCCAGTTC AGGATCACTG CTATTATCAT GGTTACATTC 420 

AGAATGAAGC TGACTCAAGT GCAGTCATCA GTGCATGTGA TGGCTTGAAA GGACATTTCA 480 

AGCATCAAGG GGAGACATAC TTTATTGAGC CCTTGGAGCT TTCTGACAGT GAAGCCCATG 540 

CAATATACAA AGATGAAAAT GTAGAAGAAG AGGAAGAGAT CCCCAAAATC TGTGGGGTTA 600 

CCCAGACTAC TTGGGAATCA GATGAGCCGA TTGAAAAGTC CTCTCAGTTA ACTAATACTC 660 

CTGAACAAGA CAGGTACTTG CAGGCCAAAA AATACATCGA GTTTTACGTG GTTGTGGACA 720 

ATGTAATGTA CMGRAAATAC ACCGGCAAGT TACATGTTAT AACAAGAAGA GTATATGAAA 780 

TGGTCAACGC TTTAAATACG ATGTACAGAC GTTTGAATTT TCACATAGCA CTGATTGGCC 840 

TAGAAATTTG GTCCAACGGA AATGAGATTA ATGTGCAATC AGACGTGCAG GCCACTTTGG 900 

ACTTATTTGG AGAATGGAGA GAAAATAAAT TGCTGCCACG CAAAAGGAAT GATAATGCTC 960 

AGTTACTCAC GAGCACTGAG TTCAATGGAA CTACTACAGG ACTTGGTTAC ATAGGCTCCC 1020 

TCTGTAGTCC GAAGAAATCT GTGGCAGTTG TTCAGGATCA TAGCAAAAGC ACAAGCATGG 1080 

TGGCAATTAC AATGGCCCAT CAGATGGGTC ATAATCTGGG CATGAATGAT GACAGAGCTT 1140 

CCTGTACTTG TGGTTCTAAC AAATGCATTA TGTCTACAAA ATATTATGAA TCTCTTTCTG 1200 



f"i 



Pi! 
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AGTTCAGCTC TTGTAGTGTC CAGGAACATC GGGAGTATCT TCTTAGAGAC AGACCACAAT 1260 

GCATTCTCAA CAAACCCTCG CGCAAAGCTA TTGTTACACC TCCAGTTTGT GGAAATTACT 1320 

TTGTGGAGCG GGGAGAAGAA TGTGACTGTG GCTCTCCTGA GGATTGTCAA AATACCTGCT 1380 

GTGATGCTGC AACTTGTAAA CTGCAACATG AGGCACAGTG TGACTCTGGA GAGTGTTGTG 1440 

AGAAATGCAA ATTTAAGGGA GCAGGAGCAG AATGCCGGGC AGCAAAGAAT GACTGTGACT 1500 

TTCCTGAACT CTGCACTGGC CGATCTGCTA AGTGTCCCAA GGACAGCTTC CAGAGGAATG 1560 

GACATCCATG CCAAAACAAC CAAGGTTACT GCTACAATGG GACATGTCCC ACCTTGACAA 1620 

ACCAATGTGC TACTCTCTGG GGGCCAGGTG CAAAAATGTC TCCAGGTTTA TGTTTTATGT 1680 

TGAACTGGAA TGCCCGAAGT TGTGGCTTGT GCAGAAAGGA AAATGGCAGA AAGATTCTAT 1740 

GTGCAGCAAA GGATGTAAAG TGTGGCAGGT TATTTTGCAA AAAGAAAAAC TCGATGATAT 1800 

GCCACTGCCC ACTCCATCAA AGGACCCAAA TTATGGAATG GTTGCACCTG GAACAAAATG 1860 

TGGAGTTAAA AAGGTGTGCA GAAACAGGCA ATGTGTTAAA GTATAGACAG CCAACTGATC 1920 

AAGCACTGCT TCTCTCAATT TGATTTTGGA GATCCTCCTT CCAGAAGGCT TTCCTCAAGT 1980 

CCAAAGAGAC CCATCTGTCT TTATCCTACT AGTAAATCAC TCTTAGCTTT CAAAAAAAAA 2040 

AAAAGTCGAC 2050 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 621 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met lie Gin Ala Leu Leu Val Ala lie Cys Leu Ala Val Phe Pro Tyr 
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Gin Gly Ser Ser He lie Leu Glu Ser Gly Asn Val Asn Asp Tyr Glu 
20 25 30 



Val Val Tyr Pro Gin Lys Val Pro Ala Leu Ser Lys Gly Gly Val Gin 
35 40 
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Asn Pro Gin Pro Glu Thr Lys Tyr Glu Asp Thr Met Gin Tyr Glu Phe 



50 



55 



60 



His Val Asn Gly Glu Pro Val Val Leu His Leu Glu Arg Asn Lys Gly 

65 70 -, c 

/u '5 80 

Leu Phe Ser Glu Asp Tyr Thr Glu Thr His Tyr Ala Pro Asp Gly Arg 



85 



90 



95 



Glu lie Thr Thr Ser Ser Pro Val Gin Asp His Cys Tyr Tyr His Gly 



100 



105 



110 



Tyr He Gin Asn Glu Ala Asp Ser Ser Ala Val He Ser Ala Cys Asp 
115 120 125 

Gly Leu Lys Gly His Phe Lys His Gin Gly Glu Thr Tyr Phe He Glu 



130 



135 



140 



Pro Leu Glu Leu Ser Asp Ser Glu Ala His Ala He Tyr Lys Asp Glu 

145 150 ice 

130 155 

Asn Val Glu Glu Glu Glu Glu He Pro Lys He Cys Gly Val Thr Gin 

165 170 175 

Thr Thr Trp Glu Ser Asp Glu Pro He Glu Lys Ser Ser Gin Leu Thr 

180 185 i 9 o 



Asn Thr Pro Glu Gin Asp Arg Tyr Leu Gin Ala Lys Lys Tyr He Glu 



195 



200 



205 



Phe Tyr Val Val Val Asp Asn Val Met Tyr Arg Lys Tyr Thr Gly Lys 
210 215 



220 



Leu His Val He Thr Arg Arg Val Tyr Glu Met Val Asn Ala Leu Asn 
225 230 



235 



240 



Thr Met Tyr Arg Arg Leu Asn Phe His He Ala Leu He Gly Leu Glu 
245 250 



255 
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lie Trp Ser Asn Gly Asn Glu lie Asn Val Gin Ser Asp Val Gin Ala 
260 265 270 



Thr Leu Asp Leu Phe Gly Glu Trp Arg Glu Asn Lys Leu Leu Pro Arg 
275 280 285 

Lys Arg Asn Asp Asn Ala Gin Leu Leu Thr Ser Thr Glu Phe Asn Gly 
290 295 300 

Thr Thr Thr Gly Leu Gly Tyr lie Gly Ser Leu Cys Ser Pro Lys Lys 
305 310 315 320 

Ser Val Ala Val Val Gin Asp His Ser Lys Ser Thr Ser Met Val Ala 
325 330 335 

lie Thr Met Ala His Gin Met Gly His Asn Leu Gly Met Asn Asp Asp 
340 345 350 

Arg Ala Ser Cys Thr Cys Gly Ser Asn Lys Cys lie Met Ser Thr Lys 
355 360 365 

Tyr Tyr Glu Ser Leu Ser Glu Phe Ser Ser Cys Ser Val Gin Glu His 
370 375 380 

Arg Glu Tyr Leu Leu Arg Asp Arg Pro Gin Cys lie Leu Asn Lys Pro 
385 390 395 400 

Ser Arg Lys Ala lie Val Thr Pro Pro Val Cys Gly Asn Tyr Phe Val 
405 410 415 

Glu Arg Gly Glu Glu Cys Asp Cys Gly Ser Pro Glu Asp Cys Gin Asn 
420 425 430 

Thr Cys Cys Asp Ala Ala Thr Cys Lys Leu Gin His Glu Ala Gin Cys 
435 440 445 

Asp Ser Gly Glu Cys Cys Glu Lys Cys Lys Phe Lys Gly Ala Gly Ala 
450 455 460 

Glu Cys Arg Ala Ala Lys Asn Asp Cys Asp Phe Pro Glu Leu Cys Thr 
465 470 475 480 

Gly Arg Ser Ala Lys Cys Pro Lys Asp Ser Phe Gin Arg Asn Gly His 
485 490 495 



Pro Cys Gin Asn Asn Gin Gly Tyr Cys Tyr Asn Gly Thr Cys Pro Thr 
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500 



505 



510 



Leu Thr Asn Gin Cys Ala Thr Leu Trp Gly Pro Gly Ala Lys Met Ser 
515 520 525 

Pro Gly Leu Cys Phe Met Leu Asn Trp Asn Ala Arg Ser Cys Gly Leu 
530 535 540 

Cys Arg Lys Glu Asn Gly Arg Lys lie Leu Cys Ala Ala Lys Asp Val 
545 550 555 560 

Lys Cys Gly Arg Leu Phe Cys Lys Lys Lys Asn Ser Met He Cys His 
565 570 575 

Cys Pro Leu His Gin Arg Thr Gin He Met Glu Trp Leu His Leu Glu 
580 585 590 

Gin Asn Val Glu Leu Lys Arg Cys Ala Glu Thr Gly Asn Val Leu Lys 
595 600 605 

Tyr Arg Gin Pro Thr Asp Gin Ala Leu Leu Leu Ser He 
610 615 620 

(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2297 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 
GTCGACCAGT CAACAGGAGA AAAGCTCAGG TTGGCTTGGA AGCAGAAAGA GATTCCTGTC 
CACCAGTCCA ATCCAGGCTC CAAAATGATC CAAGCTCTCT TGGTAATTAT ATGCTTAGCG 
GTTTTTCCAT ATCAAGGGAG CTCTATAATC CTGGAATCTG GGAATGTTAA TGATTATGAA 
GTTGTGTATC CACAAAAAGT GCCTGCATTG CTCAAAGGAG GAGTTCAGAA TCCTCAGCCA 
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GAGACCAAGT ATGAAGATAC 
CTTCACTTAG AAAGAAATAA 
CCTGATGGCA GAGAAATTAC 
TACATTCAGA ATGAAGCTGA 
CATTTCAAGC ATCAAGGGGA 
TCTCATGCAA TCTACAAAGA 
GGGGTAACCG AGACTACTTG 
AACACTCCTG AACAAGACGG 
GTGGACAACA GAATGTACAG 
TATGAAATGG TCAACGCTGT 
ATTGGCCTAG AAATTTGGTC 
ACTTTGAAAT CATTTCGAGA 
AACGCTCAGT TACTCACGGG 
GGCACCCTCT GCACTCAGAA 
ATGGTGGCAT CTACAATGGC 
GCTTCCTGTA TTTGCAGTCT 
TTTCAGTTCA GCTCTTGTAG 
CAATGCATTC TCAACAAACC 
TACTTTGTGG AGGAGGGAGA 
TGCTGCGATG CTGCAACTTG 
GATGACTGTG ACTTGCCTGA 
TTGCAGAAGA ATGGACATCC 
CCCACCTTCA CAAACCAATG 



AATGCAATAT GAATTTCAAG 
AGGACTTTTT TCAGAAGATT 
AACAAGCCCT CCGGTTCAGG 
CTCAAGTGCA ATCATCAGTG 
GACATACTTT ATTGAGCCCT 
TGAAAATGTA GAAAACGAGG 
GGAGTCAGAT GAGTCCATCG 
GTACTTGCAG GCCAAAAAAT 
GTATTACAAA CGCAATGAAC 
AAATACGAAG TACAGACCTT 
CAACCATGAT AAGTTTGAAG 
TTGGAGAGAA ACAGTTTTGC 
CATTGACTTC AATGGAACTG 
TTCTGTAGCA GTTGTTCAGG 
CCATGAGTTG GGTCATAATC 
TAGACCATGC ATTATGTCTA 
TGTCCGGGAG TATCGGGAGT 
CTTGAGCACA GATACTGTTT 
AGAATGTGAC TGTGGCTCTC 
TTAGTTTAAG GGAGAAGAAG 
ACTCTGCACT GGCCGATCTG 
ATGTCAAAAC AACAAAGGTT 
TATTGCTCTC ATGGGGACAG 



TGAATGGAGA GCCAGTAGTC 
ACACTGAAAC TCATTATGCC 
ATCACTGCTA TTATCATGGT 
CATGTGATGG CTTGAAAGGA 
TGAAGCTTTT CGACAGTGAA 
ATGAGACCCC CGAAACCTGT 
AAAAGACCTC TCAGTTAACT 
ACATCGAGTT TTACGTGGTT 
CTGCTATAAA AAGAAGAGTA 
TGAAAATTCA CATAACACTG 
TGAAGCCAGT AGCGGGTGCC 
TGCCACGCAA AAGGAATGAT 
TTGTGGGAAT TGCTTACACG 
ATTATAACCG AAAAATAAGC 
TGGGCCTTCA TCATGACGGA 
AGGGACGGAC TGCACCTGCC 
ATCTTCTTAG AGAAAGACCA 
CACCTGCAAT TTGTGGAAAT 
CTGCGGATTG TCAAAGTGCC 
CAGAATGCCG GGCAGCAAAG 
TGGAGTGTCC CACGGACAGC 
ACTGCTACAA TGGGGCATGT 
ATTTTACTGT GAGTCCAGAT 
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GGATGTTTTG ACTTGAACGT GAGAGGGAAT GATGTAAGCC ACTGCAGAAA GGAAAATGGT 1680 

GCAAAGATTC CATGTGCAGC AAAGGATGTA AAGTGTGGCA GGTTATACTG CACAGAGAGA 1740 

GACACAATGT CATGCCGATT CCCACTGGAC CCAGATGGTG TAATGGCTGA ACCTGGAACA 1800 

AAATGTGGAG ATGGAATGGT GTGCAGCAAC GGTCAGTGTG TTAATGTGCA GACAGCCTAC 1860 

TGATCAAGCA CTGGCTTCTC TCAATTTGAT TTTGGAGATC CTCCTTCCAG AACGCTTCCC 1920 

TCAAGTCCAA AGAGACCCAT CTGTCTTTAT CCTACTAGTA AATCACTCTT AGCTTTCAGA 1980 

TGGTATCTAA AATTTATAAT ATTTCTTCTC CATAATTTAA ACTGGTAATC TTTTGCTAAA 2040 

ATCAGACCTT TTCCCTGCCA CAAAGCTCCA TGGTCATGTA CAGCACCAAA GGCTTATTTG 2100 

CGAATAAGAA AAAAAAATGG CAATTTTACA GTTTCCCAAT TGCAATGCAT ATTGAATGCA 2160 

ACAAGCTCTG CCCTTTGAGC TGGCGTATTC AAAGGCAATG CTCCCTCTCC CAAAATTATA 2220 

CGCTGGCTTT CCAAGATGTA GCTGCTTCCA TCAATAAACT ATTCTCATTC TGCAAAAAAA 2280 

AAAAAAAAAA AGTCGAC 2297 
( 2 ) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 439 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

Met lie Gin Ala Leu Leu Val lie lie Cys Leu Ala Val Phe Pro Tyr 
15 10 15 

Gin Gly Ser Ser He He Leu Glu Ser Gly Asn Val Asn Asp Tyr Glu 
20 25 30 

Val Val Tyr Pro Gin Lys Val Pro Ala Leu Leu Lys Gly Gly Val Gin 

40 



Asn Pro Gin Pro Glu Thr Lys Tyr Glu Asp Thr MeC Gin Tyr Glu Phe 
50 55 60 



Gin Val Asn Gly Glu Pro Val Val Leu His Leu Glu Arg Asn Lys Gly 
65 70 75 80 

Leu Phe Ser Glu Asp Tyr Thr Glu Thr His Tyr Ala Pro Asp Gly Arg 
85 90 95 

Glu lie Thr Thr Ser Pro Pro Val Gin Asp His Cys Tyr Tyr His Gly 
100 105 110 

Tyr lie Gin Asn Glu Ala Asp Ser Ser Ala lie He Ser Ala Cys Asp 
115 120 125 

Gly Leu Lys Gly His Phe Lys His Gin Gly Glu Thr Tyr Phe He Glu 
130 135 140 

Pro Leu Lys Leu Phe Asp Ser Glu Ser His Ala lie Tyr Lys Asp Glu 
145 150 155 160 

Asn Val Glu Asn Glu Asp Glu Thr Pro Glu Thr Cys Gly Val Thr Glu 
165 170 175 

Thr Thr Trp Glu Ser Asp Glu Ser He Glu Lys Thr Ser Gin Leu Thr 
180 185 190 

Asn Thr Pro Glu Gin Asp Gly Tyr Leu Gin Ala Lys Lys Tyr He Glu 
195 200 205 

Phe Tyr Val Val Val Asp Asn Arg Met Tyr Arg Tyr Tyr Lys Arg Asn 
210 215 220 

Glu Pro Ala He Lys Arg Arg Val Tyr Glu Met Val Asn Ala Val Asn 
225 230 235 240 

Thr Lys Tyr Arg Pro Leu Lys He His He Thr Leu He Gly Leu Glu 
245 250 255 

He Trp Ser Asn His Asp Lys Phe Glu Val Lys Pro Val Ala Gly Ala 
260 265 270 

Thr Leu Lys Ser Phe Arg Asp Trp Arg Glu Thr Val Leu Leu Pro Arg 
275 280 285 
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Lys Arg Asn Asp Asn Ala Gin Leu Leu Thr Gly lie Asp Phe Asn Gly 
290 295 300 



Thr Val Val Gly He Ala Tyr Thr Gly Thr Leu Cys Thr Gin Asn Ser 
305 310 315 320 

Val Ala Val Val Gin Asp Tyr Asn Arg Lys He Ser Met Val Ala Ser 
325 330 335 

Thr Met Ala His Glu Leu Gly His Asn Leu Gly Leu His His Asp Gly 
340 345 350 

Ala Ser Cys He Cys Ser Leu Arg Pro Cys He Met Ser Lys Gly Arg 
355 360 365 

Thr Ala Pro Ala Phe Gin Phe Ser Ser Cys Ser Val Arg Glu Tyr Arg 
370 375 380 

Glu Tyr Leu Leu Arg Glu Arg Pro Gin Cys He Leu Asn Lys Pro Leu 
385 390 395 400 

Ser Thr Asp Thr Val Ser Pro Ala He Cys Gly Asn Tyr Phe Val Glu 
405 410 415 

Glu Gly Glu Glu Cys Asp Cys Gly Ser Pro Ala Asp Cys Gin Ser Ala 
420 425 430 

Cys Cys Asp Ala Ala Thr Cys 
435 

INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2335 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
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GTCGACCTCA GGTTGGCTTG GAAGCAGAAA GAGATTCCTA TCCACCACTC CAATCCAGGC 60 

TCCAAAATGA TCCAAGCTCT CTTGGTAGCT ATATGCTTAG CGGTTTTTCC ATATCAAGGG 12 0 

AGCTCTATAA TCCTGGAATC CGGGAATGTT AATGATTATG AAGTAGTGTA TCCACAAAAA 180 

GTGCCTGCAT TGTCCAAAGG AGGAGTTCAG AATCCTCAGC CAGAGACCAA GTATGAAGAT 240 

ACAATGCAAT ATGAATTTCA AGTGAATGGA GAGCCAGTAG TCCTTCACCT AGAAAGAAAT 300 

AAAGGACTTT TTTCAGAAGA TTACACTGAA ACTCATTATG CCTCTGATGG CAGAGAAATT 360 

ACAACAAGCC CACTCGTTCA GGATCACTGC TATTATCATG GTTACATTCA GAATGAAGCT 420 

GACTCAAGTG CAGTCATCAG TGCATGCGAT GGCTTGAAAG GACATTTCGA GCTTCAAGGG 480 

GAGACATACT TTATTGAACC CTTGAAGATT TCCGACAGTG AAGCCCATGC AATCTACAAA 540 

GATGAAAATG TAGAAAACGA GGATGAGACC CCCGAAACCT GTGGGGTAAC CGAGACTACT 600 

TGGGAGTCAG ATGAGTCCAT TGAAAAGACC TCTCAGTTAA CTAACACTCC TGAACAAGAC 660 

AGGTACTTGC AGGCCAAAAA ATACCTCGAG TTTTACGTGG TTGTGGACAA CATAATGTAC 720 

AGGCATTACA AACGCGATAA ACCTGTTATA AAAAGAAGAG TATATGAAAT GATCAACACT 780 

ATGAATATGG TGTACAATCG TTTGAATTTT CACATAGCAC TGATTGGCCT AGAAATTTGG 840 

TCCAACAGAA ATGAGATTAA TGTGCAATCA GACGTGCAGG CCACTTTGGA CTTATTTGGA 900 

GAATGGAGAG AAAAAAAATT GCTGCCACGC AAAAGGAATG ATAATGCTCA GTTACTCACG 960 

GGTATTGACT TCAAAGGAAC TCCTGTAGGA CTTGCTTACA TAGGTTCCAT CTGCAATCCG 1020 

AAGAGTTCTG TAGCAGTTGT TCAGGATTAT AGCAGTAGAA CAAGCATGGT GGCAATTACA 1080 

ATGGCCCATG AGATGGGTCA TAATATGGGC ATTCATCATG ACGGACCTTC CTGTACTTGT 1140 

GGTTCTAACA AATGCGTTAT GTCTACAAGA CGTACTGAAC CTGCCTATCA GTTCAGCTCT 1200 
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TGTAGTGTCC GGGAACATCA GGAGTATCTT CTTAGAGACA GACCACAATG CATTCTCAAC 1260 
AAACCCTTGA GCACAGATAT TGTTTCACCT CCAATTTGTG GAAATAACTT TGTGGAGGTG 1320 
GGAGAAGAAT GTGACTGTGG CTCTCCTGCG GATTGTCAAA GTGCCTGCTG CGACGCTACA 1380 

ACTTGTAAAC TACAACCTCA TGCACAGTGT GACTCCGAAG GGTGTTGTGA GAAATGCAAA 1440 

TTTAAGGGAG CAGGAGCAGA ATGCCGGGCA GCAAAGGATG ACTGTGACTT GCCTGAACTC 1500 

TGCACTGGCC AATCTGCTGA GTGTCCCACA GACATCTTCC AGAGGAATGG ACTTCCATGC 1560 

CAAAACAACG AAGGTTACTG CTACAATGGG AAATGCCCCA TCATGACAAA CCAATGTATT 1620 

GCTCTCCGGG GACCAGGTGT AAAAGTATCT CGAGATAGCT GTTTTACATT GAACCAGAGA 1680 

ACCAGTGGTT GTGGCTTGTG CAGAATGGAA TATGGTAGAA AGATTCCATG TGCAGCAAAG 1740 

GATGTAAAGT GTGGCAGGTT ATTTTGCAAA AAGGGAAACT CGATGATATG CAACTGCTCA 1800 

GTTTCACCAC GTGACCCAAG TTATGGAATG GTTGAACCTG GAACAAAATG TGGAGATGGA 1860 

ATGGTGTGCA GCAACAGGCA GTGTGTTGAT GTGAAGACAG CCTACTGATC AAGCACTGGC 1920 

TTCTCTCAAT TTGATTTTGG AGGTCCTCCT TCCAGAACGC TTCCCTCAAG TCCAAAGAGA 1980 

CCCATCTGTC TTTATCCTAC TAGTAAATCA CTCTTAGCTT TCAGATGGTA TCTAAAATTT 2040 

AAAATATTTC TTCTCCATAA TTTAAACTGG TAATCTTTTG CTAAAATCAG ACCTTTTCCC 2100 

TGCCACAAAG CTCCATGGTC ATGTACAGCA CCAAAGGCTT ATTTGCTAAC AAGAAAAAAA 2160 

ATGGCCATTT TACTGTTTGC CAATTGCAAT TCACATTTAA TGCAACAAGC TCTGCCCTTT 22 20 

GAGCTGGCGT ACTCAAAGGC AATGCTCCCT CTCCCAAAAT TATACGCTGG CTTTCCAAGA 2280 

TGTAGCTGCT TCCATCAATA AACTATTCTC ATTCTGAAAA AAAAAAAAAG TCGAC 2335 
(2) INFORMATION FOR SEQ ID NO: 10: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 613 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met lie Gin Ala Leu Leu Val Ala lie Cys Leu Ala Val Phe Pro Tyr 
15 10 15 

Gin Gly Ser Ser lie lie Leu Glu Ser Gly Asn Val Asn Asp Tyr Glu 
20 25 30 

Val Val Tyr Pro Gin Lys Val Pro Ala Leu Ser Lys Gly Gly Val Gin 
35 40 45 

Asn Pro Gin Pro Glu Thr Lys Tyr Glu Asp Thr Met Gin Tyr Glu Phe 
50 55 60 

Gin Val Asn Gly Glu Pro Val Val Leu His Leu Glu Arg Asn Lys Gly 
65 70 75 80 

Leu Phe Ser Glu Asp Tyr Thr Glu Thr His Tyr Ala Ser Asp Gly Arg 
85 90 95 

Glu lie Thr Thr Ser Pro Leu Val Gin Asp His Cys Tyr Tyr His Gly 
100 105 110 

Tyr lie Gin Asn Glu Ala Asp Ser Ser Ala Val He Ser Ala Cys Asp 
115 120 125 

Gly Leu Lys Gly His Phe Glu Leu Gin Gly Glu Thr Tyr Phe He Glu 
130 135 140 
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Pro Leu Lys lie Ser Asp Ser Glu Ala His Ala lie Tyr Lys Asp Glu 
145 150 155 160 



Asn Val Glu Asn Glu Asp Glu Thr Pro Glu Thr Cys Gly Val Thr Glu 
165 170 175 

Thr Thr Trp Glu Ser Asp Glu Ser lie Glu Lys Thr Ser Gin Leu Thr 
180 185 190 

Asn Thr Pro Glu Gin Asp Arg Tyr Leu Gin Ala Lys Lys Tyr Leu Glu 
195 200 205 

Phe Tyr Val Val Val Asp Asn lie Met Tyr Arg His Tyr Lys Arg Asp 
210 215 220 

Lys Pro Val lie Lys Arg Arg Val Tyr Glu Met He Asn Thr Met Asn 
225 230 235 240 

Met Val Tyr Asn Arg Leu Asn Phe His He Ala Leu He Gly Leu Glu 
245 250 255 

He Trp Ser Asn Arg Asn Glu He Asn Val Gin Ser Asp Val Gin Ala 
260 265 270 

Thr Leu Asp Leu Phe Gly Glu Trp Arg Glu Lys Lys Leu Leu Pro Arg 
275 280 285 

Lys Arg Asn Asp Asn Ala Gin Leu Leu Thr Gly He Asp Phe Lys Gly 
290 295 300 

Thr Pro Val Gly Leu Ala Tyr He Gly Ser He Cys Asn Pro Lys Ser 
305 310 315 320 

Ser Val Ala Val Val Gin Asp Tyr Ser Ser Arg Thr Ser Met Val Ala 
325 330 335 

He Thr Met Ala His Glu Met Gly His Asn Met Gly He His His Asp 
340 345 350 

Gly Pro Ser Cys Thr Cys Gly Ser Asn Lys Cys Val Met Ser Thr Arg 
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Arg Thr Glu Pro Ala Tyr Gin Phe Ser Ser Cys Ser Val Arg Glu His 
370 375 380 



Gin Glu Tyr Leu Leu Arg Asp Arg Pro Gin Cys lie Leu Asn Lys Pro 
385 390 395 400 

Leu Ser Thr Asp lie Val Ser Pro Pro lie Cys Gly Asn Asn Phe Val 
405 410 415 

Glu Val Gly Glu Glu Cys Asp Cys Gly Ser Pro Ala Asp Cys Gin Ser 
420 425 430 

Ala Cys Cys Asp Ala Thr Thr Cys Lys Leu Gin Pro His Ala Gin Cys 
435 440 445 

Asp Ser Glu Gly Cys Cys Glu Lys Cys Lys Phe Lys Gly Ala Gly Ala 
450 455 460 

Glu Cys Arg Ala Ala Lys Asp Asp Cys Asp Leu Pro Glu Leu Cys Thr 
465 470 475 480 

Gly Gin Ser Ala Glu Cys Pro Thr Asp lie Phe Gin Arg Asn Gly Leu 
485 490 495 

Pro Cys Gin Asn Asn Glu Gly Tyr Cys Tyr Asn Gly Lys Cys Pro lie 
500 505 510 

Met Thr Asn Gin Cys He Ala Leu Arg Gly Pro Gly Val Lys Val Ser 
515 520 525 

Arg Asp Ser Cys Phe Thr Leu Asn Gin Arg Thr Ser Gly Cys Gly Leu 
530 535 540 

Cys Arg Met Glu Tyr Gly Arg Lys He Pro Cys Ala Ala Lys Asp Val 
545 550 555 560 

Lys Cys Gly Arg Leu Phe Cys Lys Lys Gly Asn Ser Met He Cys Asn 
555 570 575 
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Cys Ser Val Ser Pro Arg Asp Pro Ser Tyr Gly Met Val Glu Pro Gly 
580 585 590 

Thr Lys Cys Gly Asp Gly Met Val Cys Ser Asn Arg Gin Cys Val Asp 
595 600 605 

Val Lys Thr Ala Tyr 
610 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2288 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

AGTCAACAGG AGAAAAGCTC AGGTTGGCTT GGAAGCAGAA AGAGATTCCT GTCCACCAGT 60 

CCAATCCAGG CTCCAAAATG ATCCAAGCTC TCTTGGTAAT TATATGCTTA GTGGTTTTTC 120 

CATATCAAGG GAGCTCTATA ATCCTGGAAT CTGGGAATGT TAATGATTAT GAAGTTGTGT 180 

ATCCACAAAA AGTGCCTGCA TTGCTCAAAG GAGGAGTTCA GAATCCTCAG CCAGAGACCA 2 40 

AGTATGAAGA TACAATGCAA TATGAATTTC AAGTGAATGG AGAGCCAGTA GTCCTTCACT 300 

TAGAAAGAAA TAAAGGACTT TTTTCAGAAG ATTACACTGA AACTCATTAT GCCCCTGATG 360 

GCAGAGAAAT TACAACAAGC CCTCCGGTTC AGGATCACTG CTATTATCAT GGTTACATTC 420 

AGAATGAAGC TGACTCAAGT GCAATCATCA GTGCATGTGA TGGCTTGAAA GGACATTTCA 480 
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AGCATCAAGG GGAGACATAC TTTATTGAGC CCTTGAAGCT TTTCGACAGT GAATCCCATG 540 

CAATCTACAA AGATGAAAAT GTAGAAAACG AGGATGAGAC CCCCGAAACC TGTGGGGTAA SOO 

CCGAGACTAC TTGGGAGTCA GATGAGTCCA TCGAAAAGAC CTCTCAGTTA ACTAACACTC 660 

CTGAACAAGA CGGGTACTTG CAGGCCAAAA AATACATCGA GTTTTACGTG GTTGTGGACA 720 

ACAGAATGTA CAGGTATTAC AAACGCAATG AACCTGCTAT AAAAAGAAGA GTATATGAAA 780 

TGGTCAACGC TGTAAATACG TACAGACCTT TGAAAATTCA CATAACACTG ATTGGCCTAG 340 

AAATTTGGTC CAACGATGAT AAGTTTGAAG TGAAGCCAGT AGCGGGTGCC ACTTTGAAAT 900 

CATTTCGAGA TTGGAGAGAA ACAGTTTTGC TGCCACGCAA AAGGAATGAT AACGCTCAGT 960 

TACTCACGGG CATTGACTTC AATGGAACTG TTGTGGGAAT TGCTTACACG GGCACCCTCT 1020 

GCACTCAGAA TTCTGTAGCA GTTGTTCAGG ATTATAACCG AAAAATAAGC ATGGTGGCAT 108 0 

CTACAATGGC CCATGAGTTG GGTCATAATC TGGGCCTTCA TCATGACGGA GCTTCCTGTA 1140 

TTTGCAGTCT TAGACCATGC ATTATGTCTA AGGGACGGAC TGCACCTGCC TTTCAGTTCA 1200 

GCTCTTGTAG TGTCCGGGAG TATCGGGAGT ATCTTCTTAG AGAAAGACCA CAATGCATTC 1260 

TCAACAAACC CTTGAGCACA GATACTGTTT CACCTGCAAT TTGTGGAAAT TACTTTGTGG 1320 

AGGAGGGAGA AGAATGTGAC TGTGGCTCTC CTGCGGATTG TCAAAGTGCC TGCTGCGATG 1380 

CTGCAACTTG TAAGTTTAAG GGAGAAGAAG CAGAATGCCG GGCAGCAAAG GATGACTGTG 1440 

ACTTGCCTGA ACTCTGCACT GGCCGATCTG TGGAGTGTCC CACGGACAGC TTGCAGAGGA 1500 

ATGGACATCC ATGTCAAAAC AACAAAGGTT ACTGCTACAA TGGGGCATGT CCCACCTTCA 1560 

CAAACCAATG TATTGCTCTC ATGGGGACAG ATTTTACTGT GAGTCCAGAT GGATGTTTTG 1620 

ACTTGAACGT GAGAGGGAAT TGATGTAAGC CACTGCAGAA AGGAAAATGG TGCAAAGATT 1680 
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CCATGTGCAG CAAAGGATGT AAAGTGTGGC AGATTATACT GCACAGAGAG AGACACAATG 



1740 



fii 



TCATGCCGAT TCCCACTGGA CCCAGATGGT GTTAATGGCT GAACCTGGAA CAAAATGTGG 1800 

AGATGGAATG GTGTGCAGCA ACGGTCAGTG TGTTAATGTG CAGACAGCCT ACTGATCAAG 1860 

CACTGGCTTC TCTCAATTTG ATTTTGGAGA TCCTCCTTCC AGAACGCTTC CCTCAAGTCC 1920 

AAAGAGACCC ATCTGTCTTT ATCCTACTAG TAAATCACTC TTAGCTTTCA GATGGTATCT 1980 

AAAATTTATA ATATTTCTTC TCCATAATTT AAACTGGTAA TCTTTTGCTA AAATCAGACC 2040 

TTTTCCCTGC CACAAAGCTC CATGGTCATG TACAGCACCA AAGGCTTATT TGCGAATAAG 2100 

AAAAAAAAAT GGCAATTTTA CAGTTTCCCA ATTGCAATGC ATATTGAATG CAACAAGCTC 2160 

TGCCCTTTGA GCTGGCGTAT TCAAAGGCAA TGCTCCCTCT CCCAAAATTA TACGCTGGCT 2220 

TTCCAAGATG TAGCTGCTTC CATCAATAAA CTATTCTCAT TCTGAAAAAA AAAAAAAAAA 2280 

AAGTCGAC 2288 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 521 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Mec lie Gin Ala Leu Leu Val He He Cys Leu Val Val Phe Pro Tyr 
15 10 15 
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Gin Gly Ser Ser lie lie Leu Glu Ser Gly Asn Val Asn Asp Tyr Glu 
20 25 30 



Val Val Tyr Pro Gin Lys Val Pro Ala Leu Leu Lys Gly Gly Val Gin 
35 40 45 

Asn Pro Gin Pro Glu Thr Lys Tyr Glu Asp Thr Met Gin Tyr Glu Phe 
50 55 60 

Gin Val Asn Gly Glu Pro Val Val Leu His Leu Glu Arg Asn Lys Gly 
65 70 75 80 

Leu Phe Ser Glu Asp Tyr Thr Glu Thr His Tyr Ala Pro Asp Gly Arg 
85 90 95 

Glu lie Thr Thr Ser Pro Pro Val Gin Asp His Cys Tyr Tyr His Gly 
100 105 110 

Tyr He Gin Asn Glu Ala Asp Ser Ser Ala He He Ser Ala Cys Asp 
115 120 125 

Gly Leu Lys Gly His Phe Lys His Gin Gly Glu Thr Tyr Phe He Glu 
130 135 140 

Pro Leu Lys Leu Phe Asp Ser Glu Ser His Ala He Tyr Lys Asp Glu 
145 150 155 160 

Asn Val Glu Asn Glu Asp Glu Thr Pro Glu Thr Cys Gly Val Thr Glu 
165 170 175 

Thr Thr Trp Glu Ser Asp Glu Ser He Glu Lys Thr Ser Gin Leu Thr 
180 185 190 

Asn Thr Pro Glu Gin Asp Gly Tyr Leu Gin Ala Lys Lys Tyr He Glu 
195 200 205 

Phe Tyr Val Val Val Asp Asn Arg Met Tyr Arg Tyr Tyr Lys Arg Asn 
210 215 220 

Glu Pro Ala He Lys Arg Arg Val Tyr Glu Met Val Asn Ala Val Asn 
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225 



230 



235 



240 



Thr Tyr Arg Pro Leu Lys lie His lie Thr Leu lie Gly Leu Glu He 
245 250 255 

Trp Ser Asn Asp Asp Lys Phe Glu Val Lys Pro Val Ala Gly Ala Thr 
260 265 270 

Leu Lys Ser Phe Arg Asp Trp Arg Glu Thr Val Leu Leu Pro Arg Lys 
275 280 285 

Arg Asn Asp Asn Ala Gin Leu Leu Thr Gly He Asp Phe Asn Gly Thr 
290 295 300 

Val Val Gly He Ala Tyr Thr Gly Thr Leu Cys Thr Gin Asn Ser Val 
305 310 315 320 

Ala Val Val Gin Asp Tyr Asn Arg Lys He Ser Met Val Ala Ser Thr 
325 330 335 

Met Ala His Glu Leu Gly His Asn Leu Gly Leu His His Asp Gly Ala 
340 345 350 

Ser Cys He Cys Ser Leu Arg Pro Cys He Met Ser Lys Gly Arg Thr 
355 360 365 

Ala Pro Ala Phe Gin Phe Ser Ser Cys Ser Val Arg Glu Tyr Arg Glu 
370 375 380 

Tyr Leu Leu Arg Glu Arg Pro Gin Cys He Leu Asn Lys Pro Leu Ser 
385 390 395 400 

Thr Asp Thr Val Ser Pro Ala He Cys Gly Asn Tyr Phe Val Glu Glu 
405 410 415 

Gly Glu Glu Cys Asp Cys Gly Ser Pro Ala Asp Cys Gin Ser Ala Cys 
420 425 430 

Cys Asp Ala Ala Thr Cys Lys Phe Lys Gly Glu Glu Ala Glu Cys Arg 
435 440 445 
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Ala Ala Lys Asp Asp Cys Asp Leu Pro Glu Leu Cys Thr Gly Arg Ser 
450 455 460 



Val Glu Cys Pro Thr Asp Ser Leu Gin Arg Asn Gly His Pro Cys Gin 
465 470 475 480 

Asn Asn Lys Gly Tyr Cys Tyr Asn Gly Ala Cys Pro Thr Phe Thr Asn 
485 490 495 

Gin Cys He Ala Leu Met Gly Thr Asp Phe Thr Val Ser Pro Asp Gly 
500 505 510 

Cys Phe Asp Leu Asn Val Arg Gly Asn 
515 520 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2309 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GTCGACGTCA ACAGGAGAAA AGCTCAGGTT GGCTTGGAAG CAGAAAGAGA TTCCTGTCCA 
CCAGTCCAAT CCAGGCTCCA AAATGATCCA AGCTCTCTTG GTAATTATAT GCTTAGTGGT 
TTTTCCATAT CAAGGGAGCT CTATAATCCT GGAATCTGGG AATGTTAATG ATTATGAAGT 
TGTGTATCCA CAAAAAGTGC CTGCATTGCT CAAAGGAGGA GTTCAGAATC CTCAGCCAGA 
GACCAAGTAT GAAGATACAA TGCAATATGA ATTTCAAGTG AATGGAGAGC CAGTAGTCCT 
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TCACTTAGAA AGAAATAAAG GACTTTTTTC AGAAGATTAC ACTGAAACTC ATTATGCCCC 360 

TGATGGCAGA GAAATTACAA CAAGCCCTCC GGTTCAGGAT CACTGCTATT ATCATGGTTA 420 

CATTCAGAAT GAAGCTGACT CAAGTGCAAT CATCAGTGCA TGTGATGGCT TGAAAGGACA 480 

TTTCAAGCAT CAAGGGGAGA CATACTTTAT TGAGCCCTTG AAGCTTTTCG ACAGTGAATC 540 

CCATGCAATC TACAAAGATG AAAATGTAGA AAACGAGGAT GAGACCCCCG AAACCTGTGG 600 

GGTAACCGAG ACTACTTGGG AGTCAGATGA GTCCATCGAA AAGACCTCTC AGTTAACTAA 660 

f«; CACTCCTGAA CAAGACGGGT ACTTGCAGGC CAAAAAATAC ATCGAGTTTT ACGTGGTTGT 720 

. 

feW GGACAACAGA ATGTACAGGT ATTACAAACG CAATGAACCT GCTATAAAAA GAAGAGTATA 7 80 

Hi 

fl=i TGAAATGGTC AACGCTGTAA ATACGAAGTA CAGACCTTTG AAAATTCACA TAACACTGAT 840 

ffi 

Q TGGCCTAGAA ATTTGGTCCA ACGATGATAA GTTTGAAGTG AAGCCAGTAG CGGGTGCCAC 900 

fail 

y TTTGAAATCA TTTCGAGATT GGAGAGAAAC AGTTTTGCTG CCACGCAAAA GGAATGATAA 960 

! ill 

W CGCTCAGTTA CTCACGGGCA TTGACTTCAA TGGAACTGTT GTGGGAATTG CTTACACGGG 1020 

CACCCTCTGC ACTCAGAATT CTGTAGCAGT TGTTCAGGAT TATAACCGAA AAATAAGCAT 10 80 

GGTGGCATCT ACAATGGCCC ATGAGTTGGG TCATAATCTG GGCCTTCATC ATGACGGAGC 1140 

TTCCTGTATT TGCAGTCTTA GACCATGCAT TATGTCTAAG GGACGGACTG CACCTGCCTT 1200 

TCAGTTCAGC TCTTGTAGTG TCCGGGAGTA TCGGGAGTAT CTTCTTAGAG AAAGACCACA 1260 

ATGCATTCTC AACAAACCCT TGAGCACAGA TACTGTTTCA CCTGCAATTT GTGGAAATTA 1320 

CTTTGTGGAG GAGGGAGAAG AATGTGACTG TGGCTCTCCT GCGGATTGTC AAAGTGCCTG 13 80 

CTGCGATGCT GCAACTTGTA AGTTTAAGGG AGAAGAAGCA GAATGCCGGG CAGCAAAGGA 1440 

TGACTGTGAC TTGCCTGAAC TCTGCACTGG CCGATCTGTG GAGTGTCCCA CGGACAGCTT 1500 
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GCAGAGGAAT GGACATCCAT GTCAAAACAA CAAAGGTTAC TGCTACAATG GGGCATGTCC 1560 

CACCTTCACA AACCAATGTA TTGCTCTCAT GGGGACAGAT TTTACTGTGA GTCCAGATGG 1620 

ATGTTTTGAC TTGAACGTGA GAGGGAATGA TGTAAGCCAC TGCAGAAAGG AAAATGGTGC 1680 

AAAGATTCCA TGTGCAGCAA AGGATGTAAA GTGTGGCAGG TTATACTGCA CAGAGAGAGA 1740 

CACAATGTCA TGCCGATTCC CACTGGACCC AGATGGTGTA ATGGCTGAAC CTGGAACAAA 1800 

ATGTGGAGAT GGAATGGTGT GCAGCAACGG TCAGTGTGTT AATGTGCAGA CAGCCTACTG 1860 

ATCAAGCACT GGCTTCTCTC AATTTGATTT TGGAGATCCT CCTTCCAGAA CGCTTCCCTC 1920 



AAGTCCAAAG AGACCCATCT GTCTTTATCC TACTAGTAAA TCACTCTTAG CTTTCAGATG 1980 

€> 
ffl 

Jj,^ GTATCTAAAA TTTATAATAT TTCTTCTCCA TAATTTAAAC TGGTAATCTT TTGCTAAAAT 2040 

Fi | 

Q CAGACCTTTT CCCTGCCACA AAGCTCCATG GTCATGTACA GCACCAAAGG CTTATTTGCG 2100 

f'' AATAAGAAAA AAAAATGGCA ATTTTACAGT TTCCCAATTG CAATGCATAT TGAATGCAAC 2160 

AAGCTCTGCC CTTTGAGCTG GCGTATTCAA AGGCAATGCT CCCTCTCCCA AAATTATACG 2220 
CTGGCTTTCC AAGATGTAGC TGCTTCCATC AATAAACTAT TCTCATTCTG AAAAAAAAAA 2280 
AAAAAAAAAA AAAAAAAAAA AAAGTCGAC 2309 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 592 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



Met lie Gin Ala Leu Leu Val He He Cys Leu Val Val Phe Pro Tyr 
1 5 10 15 

Gin Gly Ser Ser He He Leu Glu Ser Gly Asn Val Asn Asp Tyr Glu 
20 25 30 

Val Val Tyr Pro Gin Lys Val Pro Ala Leu Leu Lys Gly Gly Val Gin 
35 40 45 

Asn Pro Gin Pro Glu Thr Lys Tyr Glu Asp Thr Met Gin Tyr Glu Phe 
50 55 60 

Gin Val Asn Gly Glu Pro Val Val Leu His Leu Glu Arg Asn Lys Gly 
65 70 75 80 

Leu Phe Ser Glu Asp Tyr Thr Glu Thr His Tyr Ala Pro Asp Gly Arg 
85 90 95 

Glu He Thr Thr Ser Pro Pro Val Gin Asp His Cys Tyr Tyr His Gly 
100 105 no 

Tyr He Gin Asn Glu Ala Asp Ser Ser Ala He He Ser Ala Cys Asp 
115 120 125 

Gly Leu Lys Gly His Phe Lys His Gin Gly Glu Thr Tyr Phe He Glu 
130 135 140 

Pro Leu Lys Leu Phe Asp Ser Glu Ser His Ala He Tyr Lys Asp Glu 
145 150 155 160 

Asn Val Glu Asn Glu Asp Glu Thr Pro Glu Thr Cys Gly Val Thr Glu 
165 170 175 

Thr Thr Trp Glu Ser Asp Glu Ser He Glu Lys Thr Ser Gin Leu Thr 
180 185 190 

Asn Thr Pro Glu Gin Asp Gly Tyr Leu Gin Ala Lys Lys Tyr He Glu 
195 200 205 
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Phe Tyr Val Val Val Asp Asn Arg Met Tyr Arg Tyr Tyr Lys Arg Asn 
210 215 220 



Glu Pro Ala He Lys Arg Arg Val Tyr Glu Met Val Asn Ala Val Asn 
225 230 235 240 

Thr Lys Tyr Arg Pro Leu Lys He His He Thr Leu He Gly Leu Glu 
245 250. 255 

He Trp Ser Asn Asp Asp Lys Phe Glu Val Lys Pro Val Ala Gly Ala 
250 265 270 

Thr Leu Lys Ser Phe Arg Asp Trp Arg Glu Thr Val Leu Leu Pro Arg 
275 280 285 

Lys Arg Asn Asp Asn Ala Gin Leu Leu Thr Gly He Asp Phe Asn Gly 
290 295 300 

Thr Val Val Gly He Ala Tyr Thr Gly Thr Leu Cys Thr Gin Asn Ser 
305 310 315 320 

Val Ala Val Val Gin Asp Tyr Asn Arg Lys He Ser Met Val Ala Ser 
325 330 335 

Thr Met Ala His Glu Leu Gly His Asn Leu Gly Leu His His Asp Gly 
340 345 350 

Ala Ser Cys He Cys Ser Leu Arg Pro Cys He Met Ser Lys Gly Arg 
355 360 365 

Thr Ala Pro Ala Phe Gin Phe Ser Ser Cys Ser Val Arg Glu Tyr Arg 
370 375 380 

Glu Tyr Leu Leu Arg Glu Arg Pro Gin Cys He Leu Asn Lys Pro Leu 
385 390 395 400 

Ser Thr Asp Thr Val Ser Pro Ala He Cys Gly Asn Tyr Phe Val Glu 
405 410 415 



Glu Gly Glu Glu Cys Asp Cys Gly Ser Pro Ala Asp Cys Gin Ser Ala 
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420 



425 



430 



Cys Cys Asp Ala Ala Thr Cys Lys Phe Lys Gly Glu Glu Ala Glu Cys 
435 440 445 

Arg Ala Ala Lys Asp Asp Cys Asp Leu Pro Glu Leu Cys Thr Gly Arg 
450 455 460 

Ser Val Glu Cys Pro Thr Asp Ser Leu Gin Arg Asn Gly His Pro Cys 
465 470 475 480 



.ii, 
"sis' 

en 
m 
o 



Gin Asn Asn Lys Gly Tyr Cys Tyr Asn Gly Ala Cys Pro Thr Phe Thr 
485 490 495 

Asn Gin Cys lie Ala Leu Met Gly Thr Asp Phe Thr Val Ser Pro Asp 
500 505 510 

Gly Cys Phe Asp Leu Asn Val Arg Gly Asn Asp Val Ser His Cys Arg 
515 520 525 



feat: 



Lys Glu Asn Gly Ala Lys He Pro Cys Ala Ala Lys Asp Val Lys Cys 
530 535 540 



Gly Arg Leu Tyr Cys Thr Glu Arg Asp Thr Met Ser Cys Arg Phe Pro 

545 550 555 560 

Leu Asp Pro Asp Gly Val Met Ala Glu Pro Gly Thr Lys Cys Gly Asp 

565 570 575 



Gly Met Val Cys Ser Asn Gly Gin Cys Val Asn Val Gin Thr Ala Tyr 
580 585 590 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1820 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 

GTCGACGACA TTTCAAGCAT CAAGGGGAGA CATACTTTAT TGAGCCCTTG AAGCTTTTCG 60 

ACAGTGAATC CCATGCAATC TACAAAGATG AAAATGTAGA AAACGAGGAT GAGACCCCCG 12 0 

AAACCTGTGG GGTAACCGAG ACTACTTGGG AGTCAGATGA GTCCATTGAA AAGACCTCTC 180 

AGTTAACTAA CACTCCTGAA CAAGACGGGT ACTTGCAGGC CAAAAAATAC ATCGAGTTTT 240 

. 

Mj! ACGTGGTTGT GGACAACAGA ATGTACAGGT ATTACAAACG CAATGAACCT GCTATAAAAA 300 



Ui 

fu 



FU 



GAAGAGTATA TGAAATGGTC AACGCTGTAA ATACGAAGTA CAGACCTTTG AAAATTCACA 360 

TAACACTGAT TGGCCTAGAA ATTTGGTCCA ACGATGATAA GTTTGAAGTG AAGCCAGTAG 420 

CGGGTGCCAC TTTGAAATCA TTTCGAGATT GGAGAGAAAC AGTTTTGCTG CCACGCAAAA 480 

GGAATGATAA CGCTCAGTTA CTCACGGGCA TTGACTTCAA TGGAACTGTT GTGGGAATTG 540 

CTTACACGGG CACCCTCTGC ACTCAGAATT CTGTAGCAGT TGTTCAGGAT TATAACCGAA 600 

AAATAAGCAT GGTGGCATCT ACAATGGCCC ATGAGTTGGG TCATAATCTG GGCATTCATC 660 

ATGACGGAGC TTCCTGTATT TGCAGTCTTA AACCATGCAT TATGTCTAAG GGACGGACTG 720 

CACCTGCCTT TCAGTTCAGC TCTTGTAGTG TCCGGGAGTA TCGGGAGTAT CTTCTTAGAA 7 80 

AAAGACCACA ATGCATTCTC AACAAACCCT TGAGCACAGA TATTGTTTCA CCTGCAATTT 340 

GTGGAAATTA CTTTGTGGAG GAGGGAGAAG AATGTGACTG TGGCTCTCCT GCGGATTGTC 900 

AAAGTGCCTG CTGCAATGCT GCAACTTGTA AGTTTAAGGG AGAAGAAGCA GAATGCCGGG 960 

CAGCAAAGGA TGACTGTGAC TTGCCTGAAC TCTGCACTGG CCGATCTGTG GAGTGTCCCA 1020 
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. i's 



CGGACAGCTT GCAGAGGAAT GGACATCCAT GTCAAAACAA CAAAGGTTAC TGCTACAATG 1080 

GGGCATGTCC CACCTTCACA AACCAATGTA TTGCTCTCAT GGGGACAGAT TTTACTGTGA 1140 

GTCCAGATGG ATGTTTTGAC TTGAACGTGA GAGGGAATGA TGTAAGCCAC TGCAGAAAGG 1200 

AAAATGGTGC AAAGATTCCA TGTGCAGCAA AGGATGTAAA GTGTGGCAGG TTATACTGCA 1260 

CAGAGAGAAA CACAATGTCA TGCCGATTCC CACTGGACCC AGATGGTGTA ATGGCTGAAC 1320 

CTGGAACAAA ATGTGGAGAT GGAATGGTGT GCAGCAACGG TCAGTGTGTT AATGTGCAGA 1380 

CAGCCTACTG ATCAAGCACT GGCTTCTCTC AATTTGATTT TGGAGATCCT CCTTCCAGAA 1440 

CGCTTCCCTC AAGTCCAAAG AGACCCATCT GTCTTTATCC TACTAGTAAA TCACTCTTAG 1500 



S'i CTTTCAGATG GTATCTAAAA TTTATAATAT TTCTTCTCCA TAATTTAAAC TGGTAATCTT 1560 

W 
f"i 

TTGCTAAAAT CAGACCTTTT CCCTGCCACA AAGCTCCATG GTCATGTACA GTACCAAAGG 1620 

a 

i - 

H 1 CTTATTTGCT AACACGAAAA AAAATGGCCA TTTTACCGTT TGCCAATTGC AATTCACATT 1680 

Fi ! 
i w 

Jl! TAATGCAACA AGCTCTGCCC TTTGAGCTGG CGTATTCAAA GGCAATGCTC CCTCTCCCAA 1740 

was- 1 

AATTATATGC TGGCTTTCCA AGATGTAGCT GCTTCCATCA ATAAACTATT CTCATTCTGA 1800 
AAAAAAAAAA AAAAGTCGAC 182 q 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 462 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



Arg Arg His Phe Lys His Gin Gly Glu Thr Tyr Phe He Glu Pro Leu 
15 10 15 

Lys Leu Phe Asp Ser Glu Ser His Ala He Tyr Lys Asp Glu Asn Val 
20 25 30 

Glu Asn Glu Asp Glu Thr Pro Glu Thr Cys Gly Val Thr Glu Thr Thr 
35 40 45 

Trp Glu Ser Asp Glu Ser He Glu Lys Thr Ser Gin Leu Thr Asn Thr 
50 55 60 

Pro Glu Gin Asp Gly Tyr Leu Gin Ala Lys Lys Tyr He Glu Phe Tyr 
65 70 75 80 

Val Val Val Asp Asn Arg Met Tyr Arg Tyr Tyr Lys Arg Asn Glu Pro 
85 90 95 

Ala He Lys Arg Arg Val Tyr Glu Met Val Asn Ala Val Asn Thr Lys 
100 105 110 

Tyr Arg Pro Leu Lys He His He Thr Leu He Gly Leu Glu He Trp 
115 120 125 

Ser Asn Asp Asp Lys Phe Glu Val Lys Pro Val Ala Gly Ala Thr Leu 
130 135 140 

Lys Ser Phe Arg Asp Trp Arg Glu Thr Val Leu Leu Pro Arg Lys Arg 
145 150 155 160 

Asn Asp Asn Ala Gin Leu Leu Thr Gly He Asp Phe Asn Gly Thr Val 
165 170 175 

Val Gly He Ala Tyr Thr Gly Thr Leu Cys Thr Gin Asn Ser Val Ala 
180 185 190 



Val Val Gin Asp Tyr Asn Arg Lys He Ser Met Val Ala Ser Thr Met 
195 200 205 
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Ala His Glu Leu Gly His Asn Leu Gly lie His His Asp Gly Ala Ser 
210 215 220 



Cys He Cys Ser Leu Lys Pro Cys He Met Ser Lys Gly Arg Thr Ala 
225 230 235 240 

Pro Ala Phe Gin Phe Ser Ser Cys Ser Val Arg Glu Tyr Arg Glu Tyr 
245 250 255 

Leu Leu Arg Lys Arg Pro Gin Cys He Leu Asn Lys Pro Leu Ser Thr 
260 265 270 

Asp He Val Ser Pro Ala He Cys Gly Asn Tyr Phe Val Glu Glu Gly 
275 280 285 

Glu Glu Cys Asp Cys Gly Ser Pro Ala Asp Cys Gin Ser Ala Cys Cys 
290 295 300 

Asn Ala Ala Thr Cys Lys Phe Lys Gly Glu Glu Ala Glu Cys Arg Ala 
305 310 315 320 

Ala Lys Asp Asp Cys Asp Leu Pro Glu Leu Cys Thr Gly Arg Ser Val 
325 330 335 

Glu Cys Pro Thr Asp Ser Leu Gin Arg Asn Gly His Pro Cys Gin Asn 
340 345 350 

Asn Lys Gly Tyr Cys Tyr Asn Gly Ala Cys Pro Thr Phe Thr Asn Gin 
355 360 365 

Cys He Ala Leu Met Gly Thr Asp Phe Thr Val Ser Pro Asp Gly Cys 
370 375 380 

Phe Asp Leu Asn Val Arg Gly Asn Asp Val Ser His Cys Arg Lys Glu 
385 390 395 400 

Asn Gly Ala Lys He Pro Cys Ala Ala Lys Asp Val Lys Cys Gly Arg 
405 410 415 

Leu Tyr Cys Thr Glu Arg Asn Thr Met Ser Cys Arg Phe Pro Leu Asp 
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pS7t 

|1 ! 



420 425 430 

Pro Asp Gly Val Met Ala Glu Pro Gly Thr Lys Cys Gly Asp Gly Met 
435 440 445 

Val Cys Ser Asn Gly Gin Cys Val Asn Val Gin Thr Ala Tyr 
450 455 460 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2359 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

GTCGACCTCA GGTTGGCTTG GAAGCAGAAA GAGATTCCTA TCCACCACTC CAATCCAGGC 60 

TCCAAAATGA TCCAAGCTCT CTTGGTAGCT ATATGCTTAG CGGTTTTTCC ATATCAAGGG 120 

AGCTCTATAA TCCTGGAATC CGGGAATGTT AATGATTATG AAGTAGTGTA TCCACAAAAA 180 

GTGCCTGCAT TGTCCAAAGG AGGAGTTCAG AATCCTCAGC CAGAGACCAA GTATGAAGAT 240 

ACAATGCAAT ATGAATTTCA AGTGAATGGA GAGCCAGTAG TCCTTCACCT AGAAAGAAAT 300 

AAAGGACTTT TTTCAGAAGA TTACACTGAA ACTCATTATG CCTCTGATGG CAGAGAAATT 360 

ACAACAAGCC CACTCGTTCA GGATCACTGC TATTATCATG GTTACATTCA GAATGAAGCT 420 
GACTCAAGTG CAGTCATCAG TGCATGCGAT GGCTTGAAAG GACATTTCGA GCTTCAAGGG > 480 

GAGACATACT TTATTGAACC CTTGAAGATT TCCGACAGTG AAGCCCATGC AATCTACAAA 540 

GATGAAAATG TAGAAAACGA GGATGAGACC CCCGAAACCT GTGGGGTAAC CGAGACTACT 600 

TGGGAGTCAG ATGAGTCCAT TGAAAAGACC TCTCAGTTAG ACGACGACGA CAAGCGGCCG 660 
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CCAACTAACA CTCCTGAACA AGACAGGTAC TTGCAGGCCA AAAAATACCT CGAGTTTTAC 
GTGGTTGTGG ACAACATAAT GTACAGGCAT TACAAACGCG ATAAACCTGT TATAAAAAGA 
AGAGTATATG AAATGATCAA CACTATGAAT ATGGTGTACA ATCGTTTGAA TTTTCACATA 



720 



780 



840 
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GCACTGATTG GCCTAGAAAT TTGGTCCAAC AGAAATGAGA TTAATGTGCA ATCAGACGTG 900 

CAGGCCACTT TGGACTTATT TGGAGAATGG AGAGAAAAAA AATTGCTGCC ACGCAAAAGG 960 

AATGATAATG CTCAGTTACT CACGGGTATT GACTTCAAAG GAACTCCTGT AGGACTTGCT 1020 

TACATAGGTT CCATCTGCAA TCCGAAGAGT TCTGTAGCAG TTGTTCAGGA TTATAGCAGT 1080 

AGAACAAGCA TGGTGGCAAT TACAATGGCC CATGAGATGG GTCATAATAT GGGCATTCAT 1140 

CATGACGGAC CTTCCTGTAC TTGTGGTTCT AACAAATGCG TTATGTCTAC AAGACGTACT 1200 

GAACCTGCCT ATCAGTTCAG CTCTTGTAGT GTCCGGGAAC ATCAGGAGTA TCTTCTTAGA 1260 

GACAGACCAC AATGCATTCT CAACAAACCC TTGAGCACAG ATATTGTTTC ACCTCCAATT 1320 

TGTGGAAATA ACTTTGTGGA GGTGGGAGAA GAATGTGACT GTGGCTCTCC TGCGGATTGT 1380 

CAAAGTGCCT GCTGCGACGC TACAACTTGT AAACTACAAC CTCATGCACA GTGTGACTCC 1440 

GAAGGGTGTT GTGAGAAATG CAAATTTAAG GGAGCAGGAG CAGAATGCCG GGCAGCAAAG 1500 

GATGACTGTG ACTTGCCTGA ACTCTGCACT GGCCAATCTG CTGAGTGTCC CACAGACATC 1560 

TTCCAGAGGA ATGGACTTCC ATGCCAAAAC AACGAAGGTT ACTGCTACAA TGGGAAATGC 1620 

CCCATCATGA CAAACCAATG TATTGCTCTC CGGGGACCAG GTGTAAAAGT ATCTCGAGAT 1680 

AGCTGTTTTA CATTGAACCA GAGAACCAGT GGTTGTGGCT TGTGCAGAAT GGAATATGGT 1740 

AGAAAGATTC CATGTGCAGC AAAGGATGTA AAGTGTGGCA GGTTATTTTG CAAAAAGGGA 1800 

AACTCGATGA TATGCAACTG CTCAGTTTCA CCACGTGACC CAAGTTATGG AATGGTTGAA 1860 

CCTGGAACAA AATGTGGAGA TGGAATGGTG TGCAGCAACA GGCAGTGTGT TGATGTGAAG 1920 

ACAGCCTACT GATCAAGCAC TGGCTTCTCT CAATTTGATT TTGGAGGTCC TCCTTCCAGA 1980 



ACGCTTCCCT CAAGTGCAAA GAGACCCATC TGTCTTTATC CTACTAGTAA ATCACTCTTA 2040 
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GCTTTCAGAT GGTATCTAAA ATTTAAAATA TTTCTTCTCC ATAATTTAAA CTGGTAATCT 2100 

TTTGCTAAAA TCAGACCTTT TCCCTGCCAC AAAGCTCCAT GGTCATGTAC AGCACCAAAG 2160 

GCTTATTTGC TAACAAGAAA AAAAATGGCC ATTTTACTGT TTGCCAATTG CAATTCACAT 2220 

TTAATGCAAC AAGCTCTGCC CTTTGAGCTG GCGTACTCAA AGGCAATGCT CCCTCTCCCA 2280 

AAATTATACG CTGGCTTTCC AAGATGTAGC TGCTTCCATC AATAAACTAT TCTCATTCTG 2340 

AAAAAAAAAA AAAGTCGAC 2359 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 621 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

H • 
Ms 

fjj 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

pi; 

Met lie Gin Ala Leu Leu Val Ala lie Cys Leu Ala Val Phe Pro Tyr 
15 10 15 

Gin Gly Ser Ser lie lie Leu Glu Ser Gly Asn Val Asn Asp Tyr Glu 
20 25 30 

Val Val Tyr Pro Gin Lys Val Pro Ala Leu Ser Lys Gly Gly Val Gin 
35 40 45 

Asn Pro Gin Pro Glu Thr Lys Tyr Glu Asp Thr Met Gin Tyr Glu Phe 
50 55 60 

Gin Val Asn Gly Glu Pro Val Val Leu His Leu Glu Arg Asn Lys Gly 
65 70 75 80 

Leu Phe Ser Glu Asp Tyr Thr Glu Thr His Tyr Ala Ser Asp Gly Arg 
85 90 95 

Glu lie Thr Thr Ser Pro Leu Val Gin Asp His Cys Tyr Tyr His Gly 
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Tyr He Gin Asn Glu Ala Asp Ser Ser Ala Val He Ser Ala Cys Asp 
115 120 125 



Gly Leu Lys Gly His Phe Glu Leu Gin Gly Glu Thr Tyr Phe lie Glu 
130 135 140 



Pro Leu Lys He Ser Asp Ser Glu Ala His Ala He Tyr Lys Asp Glu 
145 150 155 160 

Asn Val Glu Asn Glu Asp Glu Thr Pro Glu Thr Cys Gly Val Thr Glu 
165 170 175 

Thr Thr Trp Glu Ser Asp Glu Ser He Glu Lys Thr Ser Gin Leu Asp 
180 185 190 



Asp Asp Asp Lys Arg Pro Pro Thr Asn Thr Pro Glu Gin Asp Arg Tyr 
195 200 205 

Leu Gin Ala Lys Lys Tyr Leu Glu Phe Tyr Val Val Val Asp Asn He 
210 215 220 



Met Tyr Arg His Tyr Lys Arg Asp 
225 230 

Tyr Glu Met He Asn Thr Met Asn 
245 

His He Ala Leu He Gly Leu Glu 
260 

Asn Val Gin Ser Asp Val Gin Ala 
275 280 

Arg Glu Lys Lys Leu Leu Pro Arg 
290 295 



Lys Pro Val lie Lys Arg Arg Val 
235 240 

Met Val Tyr Asn Arg Leu Asn Phe 
25.0 255 

He Trp Ser Asn Arg Asn Glu He 
265 270 

Thr Leu Asp Leu Phe Gly Glu Trp 
285 

Lys Arg Asn Asp Asn Ala Gin Leu 
300 



Leu Thr Gly He Asp Phe Lys Gly Thr Pro Val Gly Leu Ala Tyr He 
305 310 315 320 

Gly Ser He Cys Asn Pro Lys Ser Ser Val Ala Val Val Gin Asp Tyr 
325 330 335 

Ser Ser Arg Thr Ser Met Val Ala He Thr Met Ala His Glu Met Gly 
340 345 350 



66 



His Asn Met Gly lie His His Asp Gly Pro Ser Cys Thr Cys Gly Ser 
355 360 365 

Asn Lys Cys Val Met Ser Thr Arg Arg Thr Glu Pro Ala Tyr Gin Phe 
370 375 380 

Ser Ser Cys Ser Val Arg Glu His Gin Glu Tyr Leu Leu Arg Asp Arg 
385 390 395 400 

Pro Gin Cys lie Leu Asn Lys Pro Leu Ser Thr Asp lie Val Ser Pro 
405 410 415 

Pro lie Cys Gly Asn Asn Phe Val Glu Val Gly Glu Glu Cys Asp Cys 
420 425 430 

, :•=< Gly Ser Pro Ala Asp Cys Gin Ser Ala Cys Cys Asp Ala Thr Thr Cys 

lb"- 435 440 445 

w 

Q'l Lys Leu Gin Pro His Ala Gin Cys Asp Ser Glu Gly Cys Cys Glu Lys 

Hi 450 455 460 

SS11. 

M 

¥ Cys Lys Phe Lys Gly Ala Gly Ala Glu Cys Arg Ala Ala Lys Asp Asp 

^ 465 470 475 480 
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Cys Asp Leu Pro Glu Leu Cys Thr Gly Gin Ser Ala Glu Cys Pro Thr 
485 490 495 

Asp lie Phe Gin Arg Asn Gly Leu Pro Cys Gin Asn Asn Glu Gly Tyr 
500 505 510 

Cys Tyr Asn Gly Lys Cys Pro lie Met Thr Asn Gin Cys lie Ala Leu 
515 520 525 

Arg Gly Pro Gly Val Lys Val Ser Arg Asp Ser Cys Phe Thr Leu Asn 
530 535 540 

Gin Arg Thr Ser Gly Cys Gly Leu Cys Arg Met Glu Tyr Gly Arg Lys 

545 550 555 560 

lie Pro Cys Ala Ala Lys Asp Val Lys Cys Gly Arg Leu Phe Cys Lys 
565 570 575 

Lys Gly Asn Ser Met lie Cys Asn Cys Ser Val Ser Pro Arg Asp Pro 
580 585 590 

Ser Tyr Gly Met Val Glu Pro Gly Thr Lys Cys Gly Asp Gly Met Val 
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Cys Ser Asn Arg Gin Cys Val Asp Val Lys Thr Ala Tyr 
610 615 620 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
ACNCCNGARC ARGAY 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
RTAYTTYCKR TACAT 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
CAGGACAGGT ACTTGCAGGC CAAA 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO-.22: 
ATCGAGTTTT ACGTGGTTGT GGAC 
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